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EXPRESSION OF G-CSF AND MUTEINS THEREOF AND THEIR USE 

The Invention concerns the field of recombinant DNA. In 
particular, the invention concerns the Isolation of nucleic acid 
sequences encoding granulocyte colony-stimulating factor (G-CSF) and 

5 the expression of G-CSF 1n various host cells. The Invention concerns 
modified DNA sequences encoding G-CSF and mutelns thereof that are 
efficiently expressed under the control of promoters operable 1n E. 
coH, expression vectors utilizing the modified DNA sequences for the 
efficient expression of G-CSF 1n E. coll and Gram-negative hosts known 

10 to exchange genetic material with E. coll, host organisms including 
microorganisms transformed with the expression vectors and G-CSF 
produced thereby. 

Interest 1n the growth and differentiation of granulocytes 
and macrophages from bone marrow progenitors has prompted the study of 

15 the colony-stimulating factors (CSFs) that regulate these processes. 
The subject has been recently reviewed by D. Metcalf 1n Blood, 67; 257 
(1986). One of these factors, termed granulocyte-CSF (G-CSF) 
stimulates the 1n vitro growth of predominantly granulocyte colonies 
from bone marrow stem cells. The availability of cell lines that 

20 produce CSFs has been Instrumental 1n the characterization of G-CSF; 
however, proper Interpretation of this work requires the precise 
Identification of the CSF released by these cells. One such cell 
line, the MIA PaCa-2, derived from a human pancreatic carcinoma, 
reportedly produces both the macrophage-speclfic CSF-1 and a CSF that 

25 can Induce the formation of primarily granulocyte colonies In seven 
day bone marrow cell cultures. Wu, M. C, J. CUn. Invest., 67:1588 
(1981). A CSF-1 cDNA clone has been recently Isolated from this cell 
line, Kawasaki et al., Science, 230:291 (1985). They also showed, by 
translation of sucrose gradient fractionated mRNA 1n Xenopus laevis 

30 oocytes, that a transcript that sedlmented slower than the 18s marker 
encoded a CSF that Induced the proliferation of murine bone marrow 
cells; however, the CSF-1 clone hybridized most strongly to mRNA 1n 
the fractions that sedlmented faster than the 18s marker. 
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Human G-CSF causes the differentiation of granulocytes and 
monocytes from bone marrow progenitor cells. It also Induces some 
leukemia cells to differentiate Into more mature cell types. Thus, 
human G-CSF may be useful 1n regeneration of Important blood cell 

5 populations 1n Immunologically compromised patients and patients with 
anemic disorders or Infections. G-CSF may also be useful 1n 
Inhibiting leukemia by causing leukemia cells to differentiate to a 
non-dividing state, thereby checking the uncontrolled proliferation of 
these cell types. Myeloid leukemia cells are known to have receptors 

10 for G-CSF, for example. Science, 323: 7 (1986). 

Souza et al. have Isolated, cloned, and expressed a gene 
encoding G-CSF, which has no significant homology to GM-CSF, in an 
unidentified strain of E. coll using a modified temperature sensitive 
runaway plasmld with a x phage PL promoter and synthetic DNA fragment 
15 containing an Initiation codon, followed by the sequence encoding the 
mature form of unaltered G-CSF. No data on expression levels are, 
however, disclosed. Souza et al., Science, 232 :61-65 (1986). 

In addition, a colony stimulating factor having the same N- 
terminal 20 amino acids as that of Souza et al. was Isolated from a 

20 cultured human oral carcinoma as disclosed in European Patent 
Publication 0169,566 to Nagata et al. This clone designated CHU-2, 
has an additional sequence encoding Val-Ser-Glu just prior to the 
cys35 residue of the protein encoded by the DNA sequence cloned from 
MIA PaCa-2, as described herein below. No Information 1s given in 

25 either of the above-mentioned disclosures as to the levels of 
expression in recombinant hosts, parti culaly E. coli. Nagata et al. 
subsequently reported a cDNA sequence that lacks the sequence encoding 
the above-mentioned additional tr1-pept1de sequence. Nagata et al., 
EMBO Journal 5:3, 575-581 (1986). 

30 Reportedly, both of these recomblnantly produced G-CSFs have 

NH 2 terminal methionines as a result of expression In the recombinant 
bacterial host E. coll. Such NH 2 -term1nal methionines are generally 
known to be immunogenic when the protein 1s administered parenteral ly 
to a mammalian host. 
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A number of factors are Involved 1n obtaining maximal 
expression of a protein 1n a recombinant host. Among these factors 
are optimization of the physiological conditions for growth of the 
host cell by changes to the culture media 1n which the host grows. 
5 Also Important Is the selection and use of strong promoter and 
Hbosome binding sites (RBS) that are operable 1n the expression 
vector to be used. The elimination of attenuator regions 1f they 
occur 1n the region of regulated promoters such as the tryptophan 
operon 1s also desirable. 

10 A more subtle factor 1n maximizing protein expression 1n a 

recombinant host relates to codon utilization. In brief, codon 
utilization may be described as the ability of a particular 
recombinant host organism to translate some codons encoding a 
particular amino acid more quickly and accurately than other different 

15 codons encoding the same amino acid. Thus, although the trinucleotide 
code specifying a particular amino add 1s degenerate, i.e., more than 
one trinucleotide sequence can encode a particular amino add, all 
codons encoding a particular amino add are not utilized by the host 
with equal efficiency. Enhanced expression may be obtained by 

20 changing the nucleotide sequence of a gene 1n a recombinant host to 
the host preferred codons without changing the amino add sequence of 
the protein encoded thereby. Additionally, changes in the amino acid 
sequence of the NH 2 terminus may effect processing of the NH 2 - terminal 
methionine mentioned above. 

25 Another subtlety 1n the enhancement of gene expression 1n an 

expression vector 1n a recombinant host relates to secondary structure 
formation in the mRNA sequence to be expressed. It has been shown, 
for example, that certain "stem and loop" secondary structures placed 
in proximity to the 3 1 terminus of a gene, lead to increased stability 

30 of the mRNA transcript and increased expression of the protein encoded 
thereby, Wong et al., Proc. Nat. Acad. Scl. (USA), 83:10, 32-33 
(1986). However, It has been suggested that formation of stable 
secondary structures at the 5 1 end of an mRNA transcript may block 
efficient translation thereof, leading to reduced yield of the protein 

35 encoded by the gene. The Influence of mRNA secondary structure on 
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translation has been discussed 1n Iserentant & Flers, Gene. 9,: 1-12 
(1980) and Buell et al., Nucleic Acid Research, 13_: 1923-1938 (1985). 
DeLamarter, J. F., et al. t EMBO Journal. 4.10 :2575-2581 (1985) have 
reported that the yield of GM-CSF has been increased 1n E. coTj by 
5 means of altering third position deoxyrlbocytoslne to 
deoxyriboadenlnes 1n the first ten of the 5* codons of the DNA 
encoding mature GM-CSF, and attribute the Increase 1n protein 
synthesis to the altered 5' end of the mRNA transcript. 

It would be desirable to obtain large amounts of G-CSF for 
10 development of therapies utilizing the granulocyte stimulating and 
granulocyte-macrophage stimulating characteristics of this colony 
stimulating factor. Furthermore, it would be desirable to obtain 
forms of G-CSF that lack NH 2 - terminal methionine when expressed 1n the 
recombinant host, and particularly 1n E. coll. 

15 The present Invention makes available G-CSF 1n large 

quantities by producing It at high levels in a number of hosts. 

The Invention discloses that a previously Isolated mRNA 
fraction from MIA PaCa-2 cells having high bone marrow proliferative 
activity encodes G-CSF. The Invention encompasses the G-CSF mRNA. In 
20 addition, the Invention encompasses a cDNA sequence made from this 
mRNA fraction that encodes the G-CSF gene. 

The Invention also Includes DNA sequences encoding mature G- 
CSF; these DNA sequences have been altered to facilitate expression of 
G-CSF in recombinant hosts. In addition, DNA sequence encoding mature 
25 G-CSF or a mutein thereof lacking NH 2 -terminal methionine when 
expressed in a recombinant host, are disclosed. 

At present, Investigators have encountered great difficulty 
1n obtaining significant expression of mature G-CSF in E. coll hosts 
under P L promoter gene N-RBS control or tryptophan (Trp) promoter 
30 control when the unaltered nucleotide sequence of the mature G-CSF 1s 
used. The Inventors have determined that, when under P L gene N-RBS 
control in ^ coll, steady state levels of mRNA encoding G-CSF are 
detectable by Northern analysis as described further herein. Under 
the control of the Trp promoter, no steady state G-CSF mRNA Is found 
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1n E. coll by the same method. Surprisingly, In light of the work of 
Delemarter, et al. which suggests that alterations of the 5* end of 
the GM-CSF gene affects secondary structure of the GM-CSF mRNA 
sequence and the translation thereof, the Inventors have found that 
5 alterations 1n the 5" region of the G-CSF gene leads not only to 
Increased expression of G-CSF protein 1n both P L gene N-RBS and Trp 
controlled expression vectors 1n E. coll, 1t also leads to Increased 
levels of mRNA steady state transcript 1n Trp controlled expression 
vectors 1n E. coll. 

10 Figure 1A 1s a plot of bone marrow proliferation activity 

obtained for Xenopus oocyte supernatants made from the fractionated 
MIA PaCa-2 mRNA. 

Figure IB 1s a dot blot of MIA PaCa-2 mRNA fractions with a 
CSF-l-spedflc probe. 

15 Figure 2 is a Northern blot of the RNA obtained from MIA 

PaCa-2 cells (lane 1), LD-1 cells (lane 2), and 5637 cells (lane 3), 
probed with two different 24-mer probes. In 2A the probe was 5'- 
ATGGCTGGACCTGCCACCCACAGC-3', which hybridized to RNA Of all three 
lanes. In 2B the oligomer probe was 5 ' -AGAAGCT GGTGA6T6AGT GTGCCA-3 ' . 

20 The nine nucleotides underlined appear only 1n the CHU-2 sequence that 
purports to be G-CSF (Nagata et al., supra). This probe did not . 
hybridize to any of the cell lines tested under stringent conditions. 

Figure 3A Is a plot of the bone marrow proliferation 
activity of the supernatants collected from Xenopus laevis oocytes 40 
25 hours after Injection of the pooled MIA PaCa-2 RNA fractions. 
Activity was measured by the murine bone marrow cell proliferation 
assay described in the examples below. 

Figure 3B Is a Northern blot of the pooled MIA PaCa-2 mRNA 
fractions using a G-CSF specific y 32 P labeled ollgodeoxyrlbonucleotide 
30 probe. 

Figure 4 shows the complete sequence of G-CSF cDNA obtained 
from the sequencing of the BamH I digest of plasmid pP12 and deduced 
amino acid sequence encoded thereby. The arrow Indicates the position 
of the nine nucleotide sequence GTGATGGAG of the Nagata et al. clone. 
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Figure 5 1s a schematic Illustration showing the prepartlon 
of plasmld pPDl and the placement of a Hindi 1 1 restriction site and 
AT6 codon 1n the G-CSF sequence. 

Figure 6 1s a schematic Illustration of the prepartlon of 
5 plasmld pPD2 1n which the 3* untranslated region of the G-CSF gene Is 
excised and transcription termination and mRNA stability are placed 
under control of the positive retroregulatory element. 

Figure 7 Is a schematic Illustration of the preparation of 
plasmld pJDl in which G-CSF expression 1s under the control of the P L 
10 promoter and gene N-r1bosome binding site. 

Figure 8 1s a schematic illustration of the preparation of 
pJD4, pJD4A and pJD4B. 

Figure 9 1s a schematic Illustration of the preparation of 

pPD5. 

15 Figure 10 1s a Northern blot of mRNA from pJDl and pPD2 

probed with a G-CSF specific ollgodeoxy nucleotide probe. 

Figure 11 shows the DNA sequence and N-term1nal amino acid 
sequence encoded by native G-CSF, pJD4B, pPD5 and pJD4A. The ATG 
codon encoding Met was added to native G-CSF and does not occur in the 
20 cDNA sequence obtained from MIA PaCa-2 transcripts. 

Figure 12 1s an SDS gel showing expression of G-CSF In 
Induced and uninduced E. coli host carrying pJD4A and pJD4B. 

Figure 13 is an SDS gel showing expression of G-CSF In 
Induced and uninduced E. coll host carrying pPD5. 

25 Figure 14 1s a schematic Illustration of the preparation of 

plasmld pPD6 1n which Vj-CSF expression is under control of the P|_ 
promoter and gene N-r1bosome binding site. 

Figure 15 Is an SDS gel showing expression of mG-CSF and Vj- 
mG-CSF. The Vj-mG-CSF mobllily reflects a slightly smaller size 
30 consistant with deletion of NHg-terminal Met and Thr. 
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A. G-CSF Cloning and Expression 

The G-CSF of the present Invention was detected In a 
purified mRNA fraction obtained by sucrose density gradient 
centrlfugation of Induced MIA PaCa-2 cells as described 1n PCT 
5 Publication No. W086/04607 published August 14, 1986 and assigned to 
the assignee of the present Invention. 

mRNA fractions obtained from the gradient were Injected Into 
Xenopus laevis oocytes; the oocytes were Incubated, and the 
supernatants tested for the; ability to stimulate murine bone marrow 

10 proliferation (bmp). Peak bmp activity was found In supernatants from 
oocytes that had been Injected with the 14, 15 and 16S MIA PaCa-2 mRNA 
fractions although peak hybridization to a CSF-1 specific probe was 
obtained with the 18S mRNA fraction. This phenomlnon is illustrated 
1n Figure 1A which shows a plot of bmp activity and CSF hybridization 

15 intensity per mRNA fraction. 

Further, Investigation by the present Inventors has 
determined that the peak bmp activity associated with the slower 
sedimentlng RNA fractions Is due to the activity of G-CSF as 1s shown 
1n Figure 3A and 3B. These figures show that peak hybridization of a 
20 G-CSF specific probe with MIA PaCa-2 mRNA fractions sedimentlng slower 
than 18S. Thus, the present invention encompasses, an mRNA fraction of 
human cells that Is capable of producing G-CSF 1n a host cell. 

"G-CSF" as used herein means a protein having the effect of 
stimulating the production of primarily granulocyte colonies or 
25 granulocyte-macrophage colonies 1n a colony forming assay using bone 
marrow cell progenitors of an appropriate species. A protein having 
this activity has the deduced amino acid sequence shown in Figure 4 
herein and is considered to be within the scope of the Invention. 

G-CSF according to the Invention may be Isolated from the 
30 MIA PaCa-2 cell Une. In addition, as shown in Figure 2, G-CSF 
specific RNA sequences are clearly detectable 1n the mRNA of LD-1 
cells and 5637 cells. Thus, G-CSF obtained by cloning from these cell 
lines as well as any other cell line which 1s G-CSF probe positive, Is 
considered within the scope of the invention. Mature G-CSF produced 



8 



0256843 



by the LD-1 cell line has been Isolated and ami no-acid sequenced. 65% 
of the LD-1 produced protein has a thr residue at the NH^- terminus and 
35% has a pro residue at the NH 2 -term1nus. 

As 1s the case for all proteins* the precise chemical 

5 structure of G-CSF depends on a number of factors. As lonlzable amino 
and carboxyl groups are present 1n the molecule, a particular protein 
may be obtained as an acidic or basic salt, or 1n neutral form. All 
such preparations which retain their activity when placed 1n suitable 
environmental conditions are Included 1n the definition. Further, the 

10 primary amino acid sequence may be augmented by derivatization using 
sugar moieties (glycosylatlon) or by other supplementary molecules 
such as lipids, phosphate, acetyl groups and the like, more commonly 
by conjugation with saccharides. The primary amino acid structure may 
also aggregate to form complexes. Certain aspects of such 

15 augmentation are accomplished through post-translatlonal processing 
systems of the producing host; other such modification may be 
Introduced in vitro. In any event, such modifications are Included in 
the definition so long as the activity of the protein, as defined 
above, 1s not destroyed. It is expected, of course, that such 

20 modifications may quantitatively or qualitatively affect the activity, 
either by enhancing or diminishing the activity of the protein In the 
various assays. 

Further, Individual amino acid residues in the chain may be 
modified by oxidation, reduction, or other derivatization, and the 
25 protein may be cleaved to obtain fragments which retain activity. 
Such alterations which do not destroy activity do not remove the 
protein sequence from the definition. 

Modifications to the primary structure Itself by deletion, 
addition, or alteration of the amino adds incorporated Into the 
30 sequence during translation can be made without destroying the 
activity of the protein. Such substitutions or other alterations 
result In proteins having an amino acid sequence which falls within 
the definition of proteins "having an amino add sequence 
substantially equivalent to that of G-CSF" . 
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For convenience, the mature G-CSF protein amino acid 
sequence shown in Figure 4, deduced from the cDMA clone Illustrated 
herein, Is designated mG-CSF (mature G-CSF) beginning at the amino 
acid residue threonine designated +1. Figure 4 shows the presence of 

5 a 30 residue putative signal sequence, which is presumably cleaved 
upon secretion from mammalian cells; mG-CSF 1s represented by amino 
acids 1-174 shown in that figure. Specifically included in the 
definition of human G-CSF are muteins which monomers and dlmers, if 
any, are G-CSF and related forms of G-CSF, designated by their 

10 differences from mG-CSF. G-CSF derived from other species may fit the 
definition of "human" G-CSF by virtue of its display of the requisite 
pattern of activity as set forth above with regard to human substrate. 

Also for convenience, the amino acid sequence of G-CSF will 
be used as a reference and other sequences which are substantially 

15 equivalent to this in terms of G-CSF activity will be designated by 
referring to the sequence shown in Figure 4. The substitution of a 
particular amino acid will be noted by reference to the number of the 
amino acid residue which it replaced. Thus, for example, ser 60 G-CSF 
refers to the protein which has the sequence shown in Figure 4 except 

20 that the amino acid at position 60 is serine rather than proline. 
Deletions are noted by a v followed by the numer of amino acids 
deleted from the N-terminal sequence, or by the number of amino acids 
remaining when residues are deleted from the C-terminal sequence, when 
the number is followed by a minus sign. Thus, v 4 -G-CSF refers to G- 

25 CSF of Figure 4 wherein the first four amino acids from the N-terminus 
have been deleted; v 130 - refers to G-CSF wherein the last 44 amino 
acids following amino acid 130 have been deleted. 

"Operably linked" or "in operable linkage" refers to 
juxtaposition such that the normal function of the components can be 
30 performed. Thus, a coding sequence "operably linked" to control 
sequences refers to a configuration wherein the coding sequence can be 
expressed under the control of these sequences. 

"Control sequences" refers to DNA sequences necessary for 
the expresion of an operably linked coding sequence in a particular 
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host organism. The control sequences which are suitable for 
procaryotes, for example, include a promoter, optionally an operator 
sequence, a RBS, and possibly, other as yet poorly understood, 
sequences- Eucaryotic cells are : known to utilize promoters, 
5 polyadenylation signals, and enhancers. 

"Expression system" refers to DNA sequences containing a 
desired coding sequence and control sequences in operable linkage, so 
that hosts transformed with these sequences are capable of producing 
the encoded proteins. In order to effect transformation, the 
10 expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome. 

As used herein "cell", "cell line", and "cell culture" are 
used interchangeable and all such designations include progeny. Thus 
"transformants" or "transformed, cells" includes the primary subject 
15 cell and cultures derived therefrom without regard for the number of 
transfers. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent 
mutations. Mutant- progeny which . have the same functionality as 
screened for in the originally transformed cell, are included. Where 
20 distinct designation's are intended, ft will be clear from the context. 

Control Sequences and Corresponding Hosts 

Procaryotes most frequently are represented by various 
strains of E. coli. However * other microbial- strains may also be 
used, such as bacilli, for example Bacillus subtil is , various species 
25 of Pseudomonas, or other bacterial strains. In such procaryotic 
systems, pi asmid vectors which contain replication sites and control 
sequences derived from a species compatible with the host are used. 
For example, E. coli is typically transformed using derivatives of 
pBR322, a plasmid derived from an E. coli species by Bolivar, et al., 
30 Gene, 2^:95 (1977). . pBR322 contains genes for ampiciTlin and 
tetracycline resistance, and thus provides additional markers which 
can be either retained or destroyed in" constructing the desired 
vector. Commonly used procaryotic control sequences which are defined 
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therein to Include promoters for transcription Initiation, optionally 
with an operator, along with RBS sequences, Include such commonly used 
promoters as the beta-lactamase (penicillinase) and lactose (lac) 
promoter systems (Chang, et al., Nature (1977) 198:1056) and the 
tryptophan (Trp) promoter system (Goeddel, et al., Nucleic Acids Res. 
(1980) 8:4057) and the lambda derived P L promoter and N-gene rlbosome 
binding site (Shlmatake, et al., Nature (1981) 292:128), which has 
been made useful as a portable control cassette, as set forth 1n PCT 
Publication No. W085/03522 published August 15, 1985. However, any 
available promoter system compatible with procaryotes can be used. 

As described 1n detail herelnbelow, the native 6-CSF 
sequence has not proved to be effective for the production of G-CSF In 
such prokaryotlc microbial hosts at high levels. In the present 
invention, alterations have been carried out to the 5* end of the mG- 
CSF coding sequence. In addition to placing an ATG start codon at the 
5' end of the DNA sequence encoding mG-CSF, a number of changes have 
been made in the nucleotide sequence thereof without changing the 
amino acid sequence of the mG-CSF protein encoded thereby. 

These nucleotide changes encompass codons primarily encoding 
the first 10 amino acids of mG-CSF. At least one nucleotide 1n the 
third position of at least one of these codons has been changed from a 
deoxyrlboguanldine or deoxyrlbocytosine to a deoxyrlboadenlne. These 
changes are made without altering the amino acid residue encoded by 
the codon, and furthermore, the resulting altered codon 1s one which 
is not necessarily host-preferred, at least when the host cell 1s 
col i. More preferably, the third position codon alteration 1s carried 
out on at least one of the first 10 or 11 codons of mG-CSF wherein the 
5« amino acid sequence of mG-CSF remains 
ThrProLeuGlyProAl aSerSerLeuPro or MetThrProLeuGlyProAlaSerSerLeuPro, 
when the additional codon encoding an N- terminal Met 1s included. 

More preferably, the above-described G to A or C to A codon 
alterations are carried out on at least one of the first 4 or 5 codons 
of mG-CSF wherein the 5' amino acid sequence of mG-CSF remains 
ThrProLeuGly or MetThrProLeuGly when the additional codon encoding an 
N-terminal methionine Is included. 
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In a more preferred embodiment, the above-mentioned changes 
. to the third position nucleotide of codons encoding the 5' end of mG- 
CSF are carried out to at least three of the 10 codons of the DNA 
sequence encoding the amino acid sequence 

5 ThrProLeuGlyProAl aSerSerLeuPro. 

In an even more preferred embodiment, the above-mentioned 
changes to the third position nucleotide of codons encoding the 5* end 
of mG-CSF are carried out to at least three of the 4 codons of the DNA 
sequence encoding the amino acid sequence ThrProLeuGly. 

10 When the above-mentioned changes are made 1n the 5' end of 

the DNA sequence encoding mG-CSF that 1s 1n an E. coll compatible 
expression vector 1n operable linkage with the Trp promoter, both 
steady state mRNA transcripts encoding mG-CSF and a protein fraction 
having mG-CSF activity are readily detectable by Northern blotting and 

15 Xenopus laevls oocyte assay, bone marrow proliferation, or granulocyte 
colony stimulation assays, respectively. By contrast, the native mG- 
CSF DNA sequence 1n otherwise operable linkage with the Trp promoter 
yields neither mRNA transcript nor detectable by Northern blot nor a 
protein fraction having G-CSF activity 1n the above-mentioned assays. 

20 The native G-CSF DNA sequence In an expression vector 1n 

operable linkage with the P L promoter gene N-RBS, yields detectable 
steady state mRNA transcripts encoding G-CSF, but no protein fraction 
having mG-CSF. However, when the altered mG-CSF DNA sequence 
according to the Invention 1s placed 1n the same vector under the same 

25 promoter RBS control, a protein fraction Is produced having mG-CSF 
activity. 

The Invention also concerns muteins of G-CSF 1n which the N- 
termlnal methionine 1s not present when G-CSF Is produced from a 
recombinant host. In prokaryotlc recombinant hosts such as E. coll. 
30 the translation Initiation codon ATG encodes the amino add 
methionine. As a result, the amino or NH 2 -term1nus of recombinant 
eukaryotlc proteins produced In these recombinant hosts has a 
methionine residue that is generally not found in the protein when 
produced 1n the eukaryotic cell from which the DNA was Initially 
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obtained. This superfluous NH 2 -terra1na1 methionine, however, can be 
processed and removed by certain exopeptldase enzymes, most 
particularly methionine amino peptidase or met amino peptidase. This 
enzyme although capable of removing NH 2 -terminal amino acid residues, 
5 specifically does not remove the NH 2 - terminal methionine with equal 
efficiency from all proteins. In particular, the efficiency of the 
enzyme varies depending upon the Identity of the amino acid residue 
adjacent to the NH 2 -terminal methionine. 

The precise specificity of the enzyme in this regard cannot 
10 be determined without testing nearly an infinite number of substrates; 
however, a useful rule of thumb is set forth by Sherman, F. , et al., 
Bio Assays 3:27-31 (1985). Sherman et al. based their considerations 
for specificity of Met-amino-peptidases on the observed forms of 
mutants of iso-l-cytochrome-C from yeast and the published primary 
15 sequence of 82 mature intracellular proteins. They conclude that 
methionine is usually cleaved from residues with a side chain having a 
radius of gyration of 1.29 A or less, but generally not cleaved from 
residues with side chains larger than 1.43 A. This is consistent with 
the observation that mutationally altered 1so-l-cytochrome-C taken in 
20 consideration with other published sequences of other proteins from 
procaryotic and eucaryotic systems indicate that N-terminal methionine 
1s cleaved when it precedes residues of alanine, cysteine, glycine, 
proline, serine", threonine, or valine, but not when 1t precedes 
residues of arginine, asparagine, aspartic acid, glutamine, glutamic 
25 add, Isoleucine, leucine, lysine, or methionine. These results are 
generally consistent with those set forth in the illustrations 
below. However, some exceptions occur where the radius of gyration 
for the second amino acid is higher than 1.29 A and the secondary and 
tertiary structure or other conditions are particularly favorable. 
30 Therefore, this aspect of the specificity of the enzyme is intended as 
a general guideline and it should be borne in mind that even the 
specificity of particular amino peptidase have not been determined 
with exact precision, I.e., not all possible tertiary structures have 
been tested. Therefore, in order to fall within the definition of 
35 "Met-aminopeptldase", the enzyme needs only to meet the requirement of 
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specific cleavage of N- terminal methionine without cleavage of 
internal methionine residues and without cleavage of N-terminal 
residues other than methionine from any peptide. 

Among the NH 2 -terminal amino acid sequences that may be 
5 processed by amino terminal methionine peptidase, are those 1n which 
the methionine is followed by ala, gly or pro. In addition, NH 2 - 
terminal methionine followed by ala-met, gly-met, ala-ser, gly-gly, 
ala-pro, or pro-thr is expected to be processed. 

Thus, any of the form of G-CSF or muteins of G-CSF in which 

10 the above-mentioned sequences comprise the N-terminal portion of the 
amino acid will fall within the definition of G-CSF according to the 
invention so long as the biological activity of G-CSF is maintained. 
In particular, a deletion mutant of G-CSF, in which the threonine 
residue following the NH 2 -terminal methionine is deleted to yield a 

15 v^-G-CSF 1s illustrated in some detail hereinbelow. Specifically 
mutated forms of G-CSF in which the thr± residue is substituted with 
ala, or gly, ala-ser, gly-gly, or In which the order of pro and thr 
are reversed will also fall within the scope of the invention. DNA 
species encoding the above-indicated muteins of G-CSF may be made by 

20 site specific mutagenesis of the cDNA encoding native G-CSF using 
known techniques. Oligonucleotides encoding these muteins may also be 
substituted for DMA sequences encoding the native G-CSF amino terminal 
amino acid sequence through ligation of oligodeoxy ribonucleotides 
encoding the altered sequenced into specifically digested G-CSF DNA 

25 sequences. 

B. General Methods for Carrying out the Invention 

Transformations 

Depending on the host cell used, transformation is done 
using standard techniques appropriate to such cells. The calcium 
30 treatment employing calcium chloride, as described by Cohen, S. N. , 
Proc . Na tl , Acad . Sc1 . ( USA ) (1972) 69:2110, or the RbCl 2 method 
described in Maniatis, et al., Molecular Cloning: A Laboratory Manual 
(1982) Cold Spring Harbor Press, p. 254 was used for procaryotes or 
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other cells which contain substantial cell wall barriers. For 
mammalian cells without such cell walls, the calcium phosphate 
precipitation method of Graham and van der Eb, Virology (1978) 52:546 
may be used or the method of Wang, et al., Science, 228:149 (1985). 
5 Transformations into yeast are carried out according to the method of 
Van Solingen, P., et al., J. Bact. (1977) 130:946 and Hsiao, C. L., et 
al., Proc. Natl. Acad. Sci. (USA) (1979) 76:3829. 

Screening cDNA Libraries 

cDNA libraries are screened using the colony hybridization 

10 procedure. Lifts of colonies are made onto nitrocellulose filter 
papers (S & S type BA-85). The colonies are lysed and DNA fixed to 
the filter by treatment for five minutes with 0.5 M NaOH, 1.5 M NaCl , 
and are then washed twice for five minutes each time with 1.0 M Tris 
pH 8, 3 M NaCl. Filters are air dried and baked at 80°C for two 

15 hours. The duplicate filters are prehybridized at 45-50°C for one 
hour in 5 x SSC, 10 x Denhardt's solution (0.2% polyvinylpyrrolidone, 
0.2% Ficol'l, 0.2% BSA), 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 
100 pg/ml tRNA. 

Hybridization of the filters is done in a solution similar 
20 to that described above for prehybridization, but also contains 10% 
dextran sulfate with kinased probe at 1 x 10 6 CPM/ml under conditions 
which depend on the stringency desired. Typical moderately stringent 
conditions employ temperatures of 45-50°C for 16-20 hours with 1-5 
ml/filter of DNA hybridization buffer containing probe. For higher 
25 stringencies higher temperatures are employed. The filters are washed 
three times for 15 minutes each time at appropriate temperatures using 
3 x SSC, 0.1% SDS, air dried, and are autoradiographed at -70°C for 
two to three days. 

Vector Construction 

30 Construction of suitable vectors containing the desired 

coding and control sequences employs standard ligation and restriction 
techniques which are well understood in the art. Isolated plasmids, 
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DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and religated 1n the form desired. 

Site specific ONA cleavage 1s performed by treating with the 
suitable restriction enzyme (or enzymes) under conditions which are 
5 generally understood 1n the art, and the particulars of which are 
specified by the manufacturer of these commercial y available 
restriction enzymes. See, e.g., New England Blolabs, Product 
Catalog. In general, about 1 pg of plasmid or DNA sequence is cleaved 
by one unit of enzyme 1n about 20 yl of buffer solutions. In the 
10 examples herein, typically, a 3-10 fold excess of restriction enzyme 
is used to insure complete, digestion of the DNA substrate. Incubation 
times of about one hour to two hours at 37°C or other appropriate 
temperatures are workable, although variations can be tolerated. 
After each Incubation, protein 1s removed by extraction with 
15 phenol /chloroform, and may be followed by ether extraction, and the 
nucleic add recovered from aqueous fractions by precipitation with 
ethanol followed by running over a Sephadex G-50 spin column. If 
desired, size separation of the cleaved fragments may be performed by 
polyacyl amide gel or agarose gel electrophoresis using standard 
20 techniques. A general description of size separations 1s found 1n 
Methods in Enzymology (1980) 65:499-560 or Maniatls, Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). 

Restriction cleaved fragments may be blunt ended by treating 
25 with the large fragment of E. coll DNA polymerase I (Klenow) 1n the 
presence of the four deoxy nucleotide triphosphates (dXTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25°C 1n 50 mM Tr1s 
pH 7.6, 50 mM NaCl, 6 mM MgCl 2 . 6 mM DTT and 5-10 yM dXTPs. The 
Klenow fragment fills In at 5' overhangs but chews back protruding 3' 
30 single strands, even though the four dXTPs are present. If desired, 
selective repair can be performed by supplying only one of the, or 
selected, dXTPs within the limitations dictated by the nature of the 
sticky ends. After treatment with Klenow, the mixture is extracted 
with phenol /chloroform and ethanol precipitated followed by running 
35 over a Sephadex G-50 spin column. Treatment under appropriate 



17 



conditions with SI nuclease results In hydrolysis of any single- 
stranded portion. 

Synthetic oligonucleotides are prepared by the triester 
method of Matteucd, et al. (J. Am. Chem. Soc. (1981) 103:3185) or 

5 using conrnercially available automated oligonucleotide synthesizers. 
Kinasing of single strands prior to annealing or for labeling is 
achieved using an excess, e.g., approximately 10 units of 
polynucleotide kinase to 0.1 nmole substrate 1n the presence of 50 mM 
Tris, pH 7.6, 10 mM MgCl 2 , 5 mM dithiothreitol , 1-2 mM ATP, 1.7 pmoles 

10 Y-32P-ATP (2.9 mCi/mmole), 0.1 mM spermidine, 0.1 mM EDTA. 

Ligations are performed in 15-30 u l volumes under the 
following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 , 10 mM DTT, 33 yg/ml BSA, 10 mM-50 mM NaCl, and either 40 
V M ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0°C (for "sticky end" 
15 ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 14°C 
(for "blunt end" ligation), lntermol ecul ar "sticky end" ligations are 
usually performed at 33-100 pg/ml total DNA concentrations (5-100 nM 
total end concentration). lntermol ecul ar "blunt end" ligations 
(usually employing a 10-30 fold molar excess of linkers) are performed 
20 at 1 yM total ends concentration. 

In vector construction employing "vector fragments", the 
vector " fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 
25 approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 
1 unit of BAP per pg of vector at 60°C for about one hour. Vector 
fragments subjected to this treatment are referred to herein as 
"bapped". In order to recover the nucleic acid fragments, the 
preparation is extracted with phenol /chl oroform and ethanol 
30 precipitated and desalted by application to a Sephadex 6-50 spin 
column. Alternatively, religation can be prevented in vectors which 
have been double digested by additional restriction enzyme digestion 
of the unwanted fragments. 
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For portions of vectors derived from cDNA or genomic DNA 
which require sequence modifications, site specific primer directed 
mutagenesis 1s used. This 1s conducted using a synthetic 
oligonucleotide primer complementary to a single stranded phage DNA to 

5 be mutagenlzed except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic oligonucleotide is used as a 
primer to direct synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA Is transformed into a phage- 
supporting host bacterium. Cultures of the transformed bacteria are 

10 plated In top agar, permitting plaque formation from single cells 
which harbor the phage. 

Theoretically, 50% of the new plaques will contain the phage 
having, as a single strand, the mutated form; 50% will have the 
original sequence. The resulting plaques are hybridized with klnased 

15 synthetic primer at a temperature which permits hybridizaton of an 
exact match, but at which the mismatches with the original strand are 
sufficient to prevent hybridization. Plaques which hybridize with the 
probe are then picked, cultured, and the DNA recovered. Details of 
site specific mutation procedures are described below in specific 

20 examples. 

Verification of Construction 

In the constructions set forth below, correct ligations for 
plasmid construction are confirmed by first transforming E. coli 
strain MM294 obtained from E. coli Genetic Stock Center, CGSC 6135, or 

25 other suitable host with the ligation mixture. Successful 
transformants are selected by ampiclllin, tetracycline or other 
antibiotic resistance or using other markers depending on the mode of 
plasmid construction, as is understood in the art. Plasmids from the 
transformants are then prepared according to the method of Clewell, D. 

30 B., et al., Proc. Natl. Acad. Sci. (USA) (1969) 62:1159, optionally 
following chloramphenicol amplification (Clewell, D. B., J. Bacteriol. 
(1972) UO : 667). The isolated DNA is analyzed by restriction enzyme 
mapping and/or sequenced by the dideoxy method of Sanger, F., et al., 
Proc. Natl. Acad. Sci. (USA) (1977) _74:5463 as further described by 
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Messing, et al., Nucleic Adds Res, (1981) 2:309, or by the method of 
Maxam, et al., Methods 1n Enzymology (1980) 65:499. 

Hosts Exemplified 

Host strains used in cloning and expression herein are as 

5 follows: 

For cloning and sequencing, and for expression of 
construction under control of most bacterial promoters, E. coli strain 
MM294 (supra), Talmadge, K. , et al., Gene (1980) 22:235; Messelson, 
M., et -al-., Nature (1968) 217_:1110, was used as the host. For 
10 expression under control of the P L N-RBS promoter, E. coli strain K12 
MC1000 lambda lysogen, N 7 N 53 cI857SusP 80 , ATCC 39531 (hereinafter 
sometimes referred to as MC1000-39513 XDG95 or DG95) is used. 

For M13 phage recombinants, E. coli strains susceptible to 
phage infection, such as E. coli K12 strain DG98 are employed. The 
15 0G98 strains has been deposited with ATCC July 13, 1984 and has 
accession number 1965. 

In addition to bacteria, eucaryotic microbes, such as yeast, 
may also be used as hosts. Laboratory strains of Saccharotnyces 
cerevisiae , Baker's yeast, are most used although a number of other 

20 strains are commonly available. While vectors employing the 2 micron 
origin of replication are illustrated, Broach, J. R., Meth. Enz. 
(1983) 101_:307, other plasmid vectors suitable for yeast expression 
are known (see, for example, Stinchcomb, et al.. Nature (1979) 282:39, 
Tschempe, et al., Gene (1980) 10:157 and Clarke, L., et al., Meth. 

25 Enz. (1983) 101:300). Control sequences for yeast vectors Include 
promoters for the synthesis of glycolytic enzymes (Hess, et al., 
Adv. Enzyme Reg. (1968) ]_:U9; Holland, et al., Biochemistry (1978) 
J7:4900). Additional promoters known in the art include the promoter 
for 3-phosphoglycerate kinase (Hltzman, et al., J. Biol. Chem. (1980) 

30 255:2073), and those for other glycolytic enzymes, such as 
glyceraldehyde-3-phosphate dehydrogenase, hexosekinase, pyruvate 
decarboxylase, phosphof rue toklnase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triphosphate isomerase, 



phosphoglucose Isomerase, and glucoklnase. Other promoters, which 
have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, 
Isocytochrome C, add phosphatase, degradative enzymes associated with 
5 nitrogen metabolism, and enzymes responsible for maltose and galactose 
utilization (Holland, Ibid). Alterations to the 5' end of the mG-CSF 
DNA sequence analogous to those shown to be effective with the Trp 
promoter and P L promoter may also be carried out for use with such 
eukaryotlc promoters. 

10 It is also believed terminator sequences are desirable at 

the 3' end of the coding sequences. Such terminators are found in the 
3' untranslated region following the coding sequences in yeast-derived 
genes. Many of the vectors illustrated contain control sequences 
derived from the enolase gene containing plasmid peno46 (Holland, M. 

15 J., et al., J. Biol. Chem. (1981) 256:1385) or the LEU2 gene obtained 
from YEpl3 (Broach, «]., et al., Gene (1978) 8:121), however any vector 
containing a yeast compatible promoter, origin of repldation and 
other control sequences is suitable. 

It is also, of course, possible to express genes encoding 

20 polypeptides in eukaryotlc host cell cultures derived from 
multicellular organisms as described further hereinbelow. See, 
generally for example, Tissue Culture, Academic Press, Cruz and 
Patterson, editors (1973). Useful host cell lines include murine 
myelomas N51, VERO and HeLa cells, COS-7 cells, and Chinese hamster 

25 ovary (CHO) cells. Expression vectors for such cells ordinarily 
include promoters and control sequences compatible with mammalian 
cells such as, for example, the commonly used early and late promoters 
from Simian Virus 40 (SV 40) (Fiers, et al., Nature (1978) 273:113), 
or other viral promoters such as those derived from polyoma, 

30 Adenovirus 2, bovine paplloma virus, or avian sarcoma viruses,.. .or 
immunoglobin promoters and heat shock promoters. General aspects of 
mammalian cell host system transformations have been described by 
Axel, U.S. Patent Mo. 4,399,216 Issued August 16, 1983. It now 
appears, also that "enhancer" regions are Important 1n optimizing 

35 expression; these are, generally, sequences found upstream of the 
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promoter region. Origins of replication may be obtained, If needed, 
from viral sources. However, Integration Into the chromosome Is a 
common mechanism for DNA replication In eucaryotes. Plant cells are 
also now available as hosts, and control sequences compatible with 
5 plant cells such as the nopal Ine synthase promoter and 
polyandenylatlon signal sequences (Deplcker,, A., et al., J. MoT. 
Appl. Gen. (1982) US61 ) are available. 

The Invention will be better Illustrated 1n relationship to 
the following examples which are Intended by the Inventors to be 
10 exemplary and non-1 1m1t1ng. 

Example I 

Isolation and Expression of cDNA Encoding Human 6-CSF 

A cDNA clone encoding human granulocyte stimulating factor 
was Isolated from the MIA PaCa-2 cell Hne, and was expressed using a 

15 recombinant vector In COS-7 cells. The MIA PaCa-2 cell Hne described 
1n Yunls, A. A. et al., Experimental Hematol, 12:838-843 (1984), Is an 
established cell line pubHcally available from the Cell Repository 
Lines (CRL) collection of the American Type Culture Collection, 12301 
Parklawn Avenue, Bethesda, MD 20895 under accession number ATCC CRL 

20 1420. The cDNA clone was sequenced and the corresponding amino add 
sequence was deduced. 

A. Initial Detection of mRNA From MIA PaCa-2 Cells Having High BMP 
Activity, But Low Hybridization to CSF-1 Probes — 

The human derived pancreatic carcinoma cell Hne MIA PaCa-2 
25 was used as a source of mRNA to validate CSF-1 specific probes and for 
the formation of a cDNA library containing an Intronless form of the 
human CSF-1 coding sequence. The MIA PaCa-2 cell Hne produces CSF-1 
at a level approximately 10 fold below that of the murine L-929 cells. 

Negative control mRNA was prepared from MIA PaCa-2 cells 

30 maintained 1n serum-free medium, I.e., under conditions wherein they 

do not produce CSF-1. Cells producing CSF-1 were obtained by 
relnducing CSF-1 production after removal of the serum. 
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Cells were grown to confluence 1n roller bottles using 
Dulbecco's Modified Eagles' Medium (DMEM) containing 10% fetal calf 
serum, and produce CSF-1 at 2000-6000. un1ts/ml. The cell cultures 
were washed, and relncubated 1n serum-free condition to suppress CSF-1 
5 formation. For negative controls, no detectable CSF-1 was produced 
after a day or two of Incubation without serum. Relnduced cells were 
obtained by addition of phorbol myrlstlc acetate (100 ng/ml ) to obtain 
production after several days of 1000-2000 units/ml . 

The MIA PaCa-2 mRNA was Isolated by lysis of the cell In 
10 Isotonic buffer with 0.5% NP-40 In the presence of ribonucleoslde 
vanadyl complex (Berger, S. L., et al., Biochemistry (1979) 18:5143) 
followed by phenol /chl oroform extraction, ethanol precipitation, and 
oligo dT chromatography, and an enriched mRNA preparation obtained. 
In more detail, cells are washed twice 1n PBS (phosphate buffered 
15 saline) and are resuspended 1n IHB (140 mM NaCl, 10 mM Tris, 1.5 raM 
MgCl 2 . P H 8 * containing 10 mM vanadyl adenosine complex (Berger, S. 
L., et. al., supra ). 

A non-1on1c detergent of the ethylene oxide polymer type 
(NP-40) Is added to 0.5% to lyse the cellular, but not nuclear 
20 membranes. Nuclei are removed by centrifugatlon at 1,000 x g for 10 
minutes. The post-nuclear supernatant is added to two volumes of TE 
(10 mM Tris, 1 mM ethyl enedi ami netetraacetic add (EDTA), pH 7.5) 
saturated phenol chloroform (1:1) and adjusted to 0.5* sodium dodecyl 
sulfate (SDS) and 10 mM EDTA. The supernatant is re-extracted four 
25 times and phase separated each time by centrifugatlon at 2,000 x g for 
10 minutes. The RNA 1s precipiated by adjusting the sample to 0.25 M 
NaCl, adding 2 volumes of 100% ethanol and storing at -20°C. The RNA 
1s pelleted at 5,000 x g for 30 minutes, 1s washed with 70% and 100% 
ethanol, and 1s then dried. Polyadenylated (polyA + ) messenger RNA 
30 (mRNA) 1s obtained from the total cytoplasmic RNA by chromatography on 
oligo dT cellulose (Aviv, J., et al., Proc. Natl. Acad. Sc1. (1972) 
69:1408-1412). The RNA 1s dissolved in ETS (10 mM Tris, 1 mM EDTA, 
0.5% SDS, pH 7.5) at a concentration of 2 rag/ml. This solution 1s 
heated to 65°C for five minutes, then quickly chilled to 4*C. After 
35 bringing the RNA solution to room temperature, It is adjusted to 0.4 M 
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NaCl and 1s slowly passed through an ollgo dT cellulose column 
previously equilibrated with binding buffer (500 mM NaCl, 10 mM Trls, 
1 mM EDTA, pH 7.5 0.5% SDS). The flow- through Is passed over the 
column twice more. The column 1s then washed with 10 volumes of 
5 binding buffer. PolyA + mRNA Is eluted with allquots of ETS, extracted 
once with TE-saturated phenol chloroform and 1s precipitated by the 
addition of NaCl to 0.2 M and 2 volumes of 100% ethanol. The RNA 1s 
repred pita ted twice, 1s washed once 1n 70% and then In 100% ethanol 
prior to drying. 

10 Total- mRNA was subjected to 5-20% by weight sucrose gradient 

centrlfugatlon 1n 10 mM THs HC1, pH 7.4, 1 mM EDTA, and 0.5% SDS 
using a Beckman SW40 rotor at 20°C and 27,000 rpm for 17 hours. The 
mRNA fractions were then recovered from the gradient by ethanol 
precipitation, and Injected Into Xenopus laevls oocytes 1n the 

15 standard translation assay. The oocyte products of the RNA fractions 
were assayed In the bone marrow proliferation assay of Moore, R. N. et 
al., J. Immunol. (ig83) 131; 2374, and of Prystowsky, M. B., et al., 
Am- J. Pathol * (1984) 114; 149 and the fractions themselves were 
assayed by dot blot hybridization to a 32-mer probe corresponding to 

20 the DNA 1n the second exon of the genomic sequence (exon II probe). 
These results are summarized 1n Figure 1A and B. 

The broken line In Figure 1A shows the response 1n the bone 
marrow proliferation assay of the supernatants from the Xenopus laevls 
oocytes; Figure IB shows the dot blot results. The most strongly 
25 hybridizing fraction, 11, corresponds to greater than 18S, while the 
most active fractions 8 and 9 correspond to 14-16S. 

The mRNA was also fractionated on a denaturing formaldehyde 
gel, transferred to nitrocellulose, and probed with exon II probe. 
Several distinct species ranging 1n size from 1.5 kb to 4.5 kb were 
30 found, even under stringent hybridization conditions. 

B. Isolation S-CSF mRNA From MIA PaCa-2 Cells 

Confluent MIA PaCa-2 cells were stimulated 1n serum free 
Dulbecco's minimum essential medium (DMEM) for 4 days with phorbol 
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myHstate acetate (50 ng/ml ) and retlnolc add (10 yM). The RNA was 
prepared as described by Chlrguln et a!.; briefly, the cells were 
lysed In 5 M guanldlne 1soth1ocyanate followed by centrlfugatlon 
through a 5.7 M cesium chloride (CsCl) cushion; poly A + RNA was 
5 prepared from the lysed cells by one selection cycle on ol1go (dT) 
cellulose as described 1n Manlatls et al., supra. 

In more detail, cells were lysed 1n a solution which 
contained 5 M guanldlne 1soth1ocyanate, 0.025 M Na-c1trate, pH 7, 0.5% 
sarcoslne and 8% p-mercaptoethanol . Molecular biology grade CsCl was 
10 made up to 5.7 M or 40* w/v and buffered with 0.02 M Tr1s pH 7.5 and 
0.002 M Na-EDTA. All solutions were prepared under RNase free 
conditions and passed through 0.45 y Mllllpore filters before use. 
The lysed cells were then layered onto SW28 ultracentrlfuge tubes 
which contained layers of 10 ml 5.7 M CsCl and 6 ml 40% CsCl. 

15 After centrlfugatlon at 26,000 rpm for 18 hours, the RNA 

pellet was dissolved 1n dH 2 0 and ethanol precipitated twice. 
Polyadenylated (PolyA + ) messenger RNA (mRNA) was obtained by 
chromatography of the total RNA on oligo (dT) cellulose as described 
1n Manlatls, supra (at page 197). 

20 The RNA 1s dissolved 1n sterile H 2 0 and heated to 65°C for 

five minutes. An equal volume of a solution containing 0.040 M Tr1s 
CI pH 7.6 1.0 M NaCl, 0.002 M EDTA and 0.2% SDS Is quickly added and 
the sample 1s cooled to room temperature. The sample Is then loaded 
on an ollgO (dT) column that has been equilibrated with a buffer 

25 containing 0.020 M Trls pH 7.6, 0.5 M NaCl, 0.001 M EDTA and 0.1% 
SDS. The flow through 1s collected, heated to 65°C, cooled to room 
temperature and passed over the column once more. The column 1s then 
washed with 10 volumes of wash buffer (0.02 Trls, 0.1 M NaCl, 0.001 M 
EDTA, 0.1S SDS). Poly A + mRNA 1s eluted with allquots of 0.01 M Trls 

30 pH 7.5, 0.001 M EDTA and ethanol -precipitated twice. 

334 ug of MIA PaCa-2 Poly A + mRNA was fractionated to 5-25% 
by weight sucrose gradient centrlfugatlon 1n 20 mM Trls HC1, pH 7.5, 1 
m EDTA, and 0.5% sarcoslne using a Becknian SW40 rotor at 20°C and 
27,800 rpm for 16 hours. The mRNA fractions were collected 1n 400 jil 
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fractions and ethanol precipitated twice. Fractions were pooled and 
resuspended In 15 yl H 2 0. 

Northern blots were prepared by the electrophoresis of 5 yg 
of poly A + RNA per lane or 1 pi of each pooled RNA fraction through 1% 

5 agarose gels containing 0.5 M formaldehyde followed by blotting onto 
nitrocellulose filters. After baking the filters for 1.5 hours at 
80°C, they were prehybrldlzed (5 x SSC, 10 x Denhardt's solution (0.2% 
polyvinylpyrrolidone, 0.2X Flcoll, 0.2* BSA), O.tt SDS, 50 mM sodium 
phosphate pH 7.0, and 100 yg/ml tRNA) for one hour at 55°C. The blots 

10 were hybridized for 16 hours at 55°C 1n a similar solution that also 
contained 10* dextran sulfate and 10 6 cpm per ml of an oligonucleotide 
probe labeled with y 32 P-ATP and polynucleotide kinase. The blot shown 
1n Figure 3B was washed at 55°C in 3 x SSC, O.W SDS. This 
oligonucleotide probe had the sequence 5 ' -GTAGGTGGCACACAGCTTCTCCTG-3 1 

15 and was designed based on the sequence of the CHU-2 cDNA clone 
described by Nagata et al., Nature, 319:415-418 (1986). 

The RNA pools that were described In the above assay were 
translated 1n a Xenopus laevls oocytes assay by the Injection of 50 nl 
of of RNA Into each oocyte as described In Gurdon et al., Nature, 
20 233 ; 177-180 (1971). Supernatants of 10 yl per oocyte were collected 
after 40 hours and assayed for CSF activity. 

The Xenopus laevls supernatants from each hybridization 
fraction were assayed for CSF activity 1n a murine bone marrow cell 
proliferation assay as described in Moore, et al., J . Immunol . , 

25 131:2374-2378 (1983). Briefly, in this assay, 5 x 10 4 murine bone 
marrow cells per/well were Incubated in 96-well plates (12 x 8) with 
serially diluted Xenopus laevls oocyte supernatants made from 
positively hybridizing mRNA fractions. After three days, 3 H thymidine 
(0.5 yd/well) was added, and after six hours the cells were harvested 

30 and counted In a liquid scintillation counter. 

Peak bone marrow proliferation was found In Xenopus laevls 
oocyte supernatants made with mRNA fractions that were most strongly 
positive In Northern blots with the above-described oligonucleotide 
probe as shown In Figure 3* 
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C. Identification of G-CSF Clones In a MIA PaCa-2 cDNA Library 

A cDNA library was prepared from the enriched MIA PaCa-2 
mRNA as described 1n Kawasaki, et al.> Science, 30:291-296 (1985). 
Briefly, the method used oligo (dT) priming of the poly A + tails and 

5 AMV reverse transcriptase employing the method of Okayama, H. et al., 
Hoi. Cell Biol., J; 280 ' 289 (1983) This method results 1n a higher 
proportion of full length clones than does poly (dG) tailing and 
effectively uses as host vector portions of two vectors therein 
described, and readily obtainable from the authors, pcDVl and pLl. 

10 The resulting vectors contain the Insert between vector fragments 
containing proximal BamH I and Xhol restriction sites; the vector 
contains the pBR322 origin of replication, and ampicillin resistance 
gene and SV40 control elements which result in the ability of the 
vector to effect expression of the Inserted sequences 1n COS-7 cells. 

15 A 1.2 x 10 6 clone library 1n E. coll obtained from the above 

enriched MIA PaCa-2 mRNA by the Okayama and Berg method was then 
probed using the same oligonucleotide probe that yielded a positive 
signal on the most active pooled MIA PaCa-2 mRNA fractions. To probe 
the library, E. coll containing the Okayama-Berg vectors were grown up 

20 on nutrient medium. Colonies were lifted onto nitrocellulose filter 
papers and were lysed. DNA was fixed to the filter by treatment for 
five minutes with 0.5 mM NaOH, 1.5 M NaCl. Filters were then washed 
twice for five minutes each time with 1.5 M Tris pH 8, 3 M NaCl and 
were air dried and baked at 80°C for two hours. 

25 The filters for the screening were prehybridlzed and 

hybridized to the y 32 P labeled probe as described above 1n Section B 
of the example, but both prehybridlzatlon and hybridization were 
carried out at 50°C. Plasmlds pP12 and pP28 were determined to be 
probe positive and were further characterized. 

30 D. Sequencing of G-CSF MIA PaCa-2 Plasmlds cDNA 

pP12 was digested with BamH I and subcloned into a M13mpl9 
vector and sequenced using the dideoxy chain-terml nation method. 
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The DNA sequence and predicted protein sequence of pP12 are 
shown In Figure 4. The cDNA Insert 1n pP12 Is 1510 base pairs long 
excluding the poly (dG) and poly (dA) tails; It contains 11 more bases 
of 5' untranslated sequence than the CHU-2 G-CSF clone. The major 

5 difference between this clone derived from MIA PaCa-2 and CHU-2 cDNA 
clones of Nagata et al. Is a 9 base pair insertion (GTGATGGAG) 1n the 
CHU-2 clone that would encode the amino acid residues Val-Ser-Glu just 
prior to cys-36 In the MIA PaCa-2 G-CSF as Indicated by the arrow In 
Figure 4. There are two other differences; an A at position 588 1n 

10 the MIA PaCa-2 clone (G In the CHU-2 clone) Is a silent third base 
change, and a T at position 1237 1n the MIA PaCa-2 clone (C 1n the 
CHU-2 clone) 1s In the 3' untranslated region. 

E. Activity of the pP12 and pP28 Proteins Produced 1n COS-7 Cells 

Plasmlds pP12 and pP28 were purified using a CsCl gradient 
15 and COS-7 cells were transfected using a modification (Wang, A. M., et 
al., Science, 228:149 (1985)) of the calcium phosphate copredpltation 
technique. After Incubation for three days, supernatants were 
collected and G-CSF production was assayed 1n a colony forming assay 
and the murine bone marrow proliferation assay, as described above. 

20 CSF activity was measured by both murine bone marrow cell 

proliferation and colony formation assays. For the colony assay, 1 x 
10 5 mononuclear bone marrow cells from Balb/c mice were used 1n a 
total of 1 ml, containing 51 cell supernatant, 15i fetal calf serum, 
and 0.3% agar in RPMI 1640 medium. Colonies of greater than 40 cells 

25 were counted after seven days, and cell type determined by 
cytocentrifuglng Individual colonies and staining with a modified 
Wright stain. Each unit of CSF stimulates the formation of one 
colony. For a control, COS-7 cells were mock transfected under the 
same conditions by carrying the cells through the transfectlon steps 

30 without exposure to a plasmld. The results of the assay are described 
below in Table I. 
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TABLE I 

Production of CSF Activity by Transfected COS-7 Cells 

Colony Assay 



Transfectlon Proliferation (cpm) Units/ml %6?_ %6M^ 

5 p pi2 4595 >280 58 42 

pp 2 8 6594 >280 83 17 

Mock 633 0 0 0 

a = Colonies containing only granulocytes. 

D = Colonies containing both granulocytes and macrophages. 

10 EXAMPLE II 

Cloning G-CSF for Expression 1n E. coli 

A. Preparation of G-CSF Sequence insertion of 5* Hindi 1 1 
and ATG codon "~ 

In order to facilitate manipulation of the G-CSF DMA 

15 sequence 1n various cloning and expression vectors, the G-CSF sequence 
was altered by Inserting a Hindlll site and ATG codon 1n mG-CSF 3' of 
the GCC codon encoding alanine (-1). 

Plasmid pP12 was digested with BamHI endonuclease and the 
digest was el ectrophoresed in a Tris-acetate, 0.5% low-melt1ng point 

20 agarose gel. The small fragment band carrying the G-CSF DNA sequence 
was melted and ligated Into BamHI -digested M13mpl9 using T4 DNA Hgase 
under sticky end conditions. After ligation, the mixture was used to 
transform E^_ coll strain DG98, ATCC accession number 39,768. The 
transformed cells were plated in the presence of 0.3 mM 

25 Isopropylthiogalactoslde (IPTG) obtained from Sigma Chemicals (St. 
Louis, MO) and 0.3 mM X-gal on a lawn of DG98 and grown at 37°C. Non- 
alpha complementary white plaques were grown in liquid broth and a 
sample of the culture was used to purify repllcative form (RF) DNA. 
The presence of the Insert was confirmed by BamHI digestion and sizing 

30 the fragment on a 0.7% agarose gal. Phage having the expected insert 
were designated pA3. 
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A Hlndlll site and ATG Initiation codon were placed In the 
G-CSF sequence adjacent to and 1n frame with the codon encoding the N- 
termlnal threonine of the mG-CSF sequence by means of slte-spedflc 
mutagenesis using a chemically synthesized purified 33-mer 
5 ol1godeoxyr1bonudeot1de having the following sequence: 

5'-GTG CAG GAA GCC AAG CTT ATG ACC CCC CTG GGC-3'. 

The Hindi 1 1 site Immediately upstream of the ATG codon 
permits the convenient cloning of the mG-CSF-encod1ng DNA sequence In 
subsequent manipulation. Approximately 10 pmoles (pM) of the 33-mer 

10 was hybridized to about 1 pM of single stranded DNA from pA3 1n about 
13 wl of 10 mM Trls pH 7.4, 90 mM NaCI, 10 mM MgCl 2 by heatlrig to 85°C 
for five minutes, followed by 45°C for 20 minutes. The annealed 
mixture was chilled on 1ce and adjusted to 18 v l by the addition of 
d1th1othr1etol to 10 mM, each dXTP to 0.5 mM and 5 units of DNA 

15 Polymerase I Klenow fragment. The reaction mixture was Incubated on 
1ce for 20 minutes followed by Incubation at room temperature for one 
hour. The repair reaction mixture was then used to transform E. coll 
strain DG98 as above, plated onto agar plates and incubated overnight 
to obtain phage plaques. 

20 Plaques were blotted using nitrocellulose filter circles and 

the filters were treated as described to lyse the cells, denature the 
DNA, neutralize, rinse, fix the DNA to the filter and incubate in 
prehybrldizatlon buffer. The 33-mer oligonucleotide was end labeled 
with y 32 P using polynucleotide kinase and hybridized to the filters at 

25 45°C overnight. Filters were washed and subjected to 
autoradiography. Probe positive plaques were- grown in liquid culture 
and characterized by BamHI Hindi II digestion and agarose gel sizing to 
confirm the presence of the expected fragments. One clone having the 
desired Insert was designated pApooL. pApooL was digested with BamH I 

30 and Hlndll I. The approximately 1500 base pair BamHI -Hindi 1 1 fragment 
was Isolated, purified, and retained for ligation Into a plasmid 
having the desired promoter and terminator sequences. 
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B. Placement of G-CSF Coding Sequence Under Trp Control 

To place the G-CSF coding sequence under Trp control and 
under the Influence of the BT positive retroregul atory element (PRE), 
the G-CSF coding sequence was cloned Into plasmld pAW703. pAW703 1s a 
5 plasmld having the Trp promoter and PRE In operable linkage with a 
Hindlll -BamH I fragment which contains the gene encoding human tumor 
necrosis factor (TNF). 

As described below 1n detail, pAW703 1s digested with BamHI 
and Hindi I I and the BamH I and Hlndlll fragment of pApool encoding 6- 
10 CSF is 11 gated in place of the TNF fragment. 

The following describes the construction of pAW703 and the 
cloning of G-CSF into this plasmid to make pPDl. An alternate 
construction using pTrp3 Is also described. 

B.l. Construction of pAW703 

15 Plasmld pAW703 was made from plasmids pAW701 and pAW711 as 

follows. Plasmid pAW711 (ATCC 39,918) which contains a 400 bp BamHI- 
Sall fragment encoding the PRE was digested with Sail and BamH I 
endonucleases under conditions suggested by the supplier to yield an 
approximately 400 bp fragment containing the PRE. The digest was 

20 loaded on a 1% agarose preparatory gel and the small Sal I -BamHI 
fragment containing the PRE was isolated, electroeluted and retained. 

Plasmid pAW701 was used to obtain the Trp control sequences 
for regulating expression of the gene. 

pAW701, a derivative of plasmid pTrp3, was made by digesting 
25 pTrp3 with BamH I and Hindi 1 1 endonucleases and ligation of the large 
Hindll I -BamHI fragment with Hindlll-BamHI fragment carrying the TNF 
gene. In the construction of pAW703, pAW701 was digested with BamH I 
and Sail endonucleases. The large BamHI -Sal I fragment of pAW701 
containing the Trp promoter HindHI site and TNF sequence was Isolated 
30 by electroelution from a preparative agarose gel, and was 11 gated to 
the small BamH I -Sal I fragment containing the PRE, under sticky end 
conditions using T4 DNA ligase to produce pAW703. 
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Both pAW703 and pApool were digested with BamHI and Hindi 1 1 
under conditions suggested by the manufacturer. The large Hindlll- 
BamHI fragment from pApooL was gel Isolated and 11 gated with the 
PAW703 digest using T4 DNA llgase. 

The ligation mixture was used to transform competent E. coll 
strain MM294. Transformants were screened for amplclllln 
resistance. Plasmlds from a number of amplclllln resistant colonies 
were Isolated, digested with Hindi II and BamH I and run on a 1.5% 
agarose gel with DNA sizing standards. One clone having the expected 
size fragments was designated pPDl. 

B.2. Trp Expression Vectors 

pTrp3 is a host expression vector containing a Trp promoter 
and RBS Immediately 5' of a Hindi II restriction site, thus permitting 
Insertion of a coding sequence having a Hindi 1 1 site Immediately 5' of 
a start codon under control of the Trp promoter. The backbone vector 
for pTrp3 Is pBR322. pTrp3 was deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockvllle, MD (ATCC) on 
December 18, 1984, and has accession number 39,946. 

B.2.A. Preparation of pTrp3 

To construct the host vector containing the Trp control 
sequences behind a Hindlll site, the Trp promo ter/operator/rlbosome 
binding site sequence, lacking the attenuator region, was obtained 
from pVH153, obtained from C. Yanofsky, Stanford University. Trp 
sequences are available In a variety of such plasmlds known In the 
art. pVH153 was treated with Hhal (which cuts leaving an exposed 3' 
sticky end just 5* of the Trp promoter) blunt ended with Klenow, and 
partially digested with TapJ. The 99 bp fragment corresponding to 
restriction at the Tajgl site, 6 nucleotides preceding the ATG start 
codon of Trp leader were Isolated, and then llgated to EcpRl 
(repair )/ClaI digested, pBR322 to provide pTrp3. 

An Improved form of pTrp3 also Includes a positive 
retroregulatory sequence In a position 1n the vector downstream of the 
coding sequence to be expressed as described herelnbelow. 
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An expression vector for G-CSF having the same control 
sequences as pAW703 may be made directly from P Trp3 (ATTC 39,940) and 
pAW711 (ATCC 39,918) and pP12 as follows. 

pTrp3 is digested with Hindi I I and Sail. The large Hlndlll- 
Sall fragment 1s Isolated by electrocution from a preparative gel. 
The small BamHI-Sall fragment containing the PRE 1s Isolated from 
P AW711 as described above and Ugated to the large Hindlll-Sall 
fragment of pTrp3 under sticky end conditions with T4 DMA Ugase. 
pPDl may also be made directly by combining the BamHI-Hlndlll fragment 
of pP12 with the above-mentioned fragments from pTrp3 and pAW711 and 
Ugating under sticky end conditions with T4 DNA Hgase. 



20 



C. Placing a BamHI Restriction Site Dow nstream of the G-CSF 
Translation Termination site 

The DNA sequence encoding G-CSF in pA3 contained 
15 approximately 930 nucleotides in the transcribed but untranslated 
region 3' of the translation stop codon TGA 1n the G-CSF sequence. In 
order to remove these untranslated codons and to place transcription 
termination and mRNA stability under the influence of the positive 
retroregulatory element, a BamHI site was placed by site-specific 
mutagenesis immediately 3' of the G-CSF translation termination 
signal.' 

A chemically synthesized purified 27-mer deoxy ribonucleotide 
was chemically synthesized using a commercial DNA synthesizer and had 
the following sequence: 

CCC AGC CCT GAG GAT CCA AGC CCT CCC 
Approximately 10 pM of the 27-mer was hybridized to about 1 
pM of single-stranded DNA from PpA3 in about 13 pi of 10 mM Tr1s pH 
7.4, 90 mM NaCl, 10 mM MgCl 2 by heating to 85°C for five minutes, 
followed by 45°C for 20 minutes. The annealed mixture was chilled on 
Ice and adjusted to 18 u l by the addition of dithiothreltol to 10 mM, 
each dXTP to 0.5 mM and 5 units of DNA Polymerase I Klenow fragment. 
The reaction mixture was Incubated on Ice for 20 minutes, followed by 
Incubation at room temperature for about one hour. 
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The reaction mixture was then used to transform coll 
strain DG98, plated onto agar plates and Incubated overnight to obtain 
phage plaques. Plaques were blotted using nitrocellulose filter 
circles arid the filters were treated as described to lyse the cells, 
5 denature the DNA, neutralize, rinse, fix the DNA to the filter and 
incubate In prehybrldlzatlon buffer. The 27-mer oligonucleotide was 
end labeled with oc y phosphate using polynucleotide kinase and 
hybridized to the filters at 45°C overnight. Filters were washed and 
subjected to autoradiography. One of the probe positive plaques was 
10 designated pBpool. 

pBpool was grown 1n liquid culture and digested with Apal 
and BaroH I endonucl eases under conditions specified by the 
manufacturer. The coding region of G-CSF has an Internal Apal site. 
This digestion generated an Apal -BamH I fragment of approximately 540 
15 base pairs, with the newly Inverted BamH I end Immediately 3' of the 
translation stop codon of the mG-CSF gene. 

Plasmid pPDl was also digested with Apal and BamH I under 
conditions suggested by the supplier. This digestion yielded an Apal- 
BamH I fragment 1n which the Apal site was the same as that described 
20 above and the BamH I site 1s 3' of the untranslated region of the G-CSF 
gene. 

The two digests were mixed at a molar excess of digested 
pBpool to pPDl and llgated under sticky end conditions using T4 DNA 
ligase. The ligation mixture was transformed into coll strain 
25 MM294. Transformants were screened for amplcillln resistance. 
Several colonies were Isolated and digested with Hindlll and BamHI. A 
colony having the correct length Hindi I I -BamHI Insert was designated 
pPD2. 

D. Placing mG-CSF Under Pj Control 

3° Plasmid pLAPHs Is a plasmid having the P L promoter and gene 

N-RBS controlling the expression of the Apha gene. The Aphe gene Is 
contained In a Hindi I I -BamHI fragment. The plasmid further contains 
the positive retroreguiatory element 3' to the Apha gene with a BamHI 
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site directly 5' of the PRE. Plasmld pLAphfi was made from plasmld 
pFC54.T (ATCC number 39,739). Plasmld pLAPHe Is exactly the same as 
pFC54.T except that the latter plasmld has a Hindi I I -BamHI fragment 
encoding des-alanyl-IL-2 1n place of the Aphe encoding fragment. 

i Thus, the following steps to place the expression of 6-CSF under 
temperature sensitive P L promoter-gene N-RBS and PRE control can also 
' be carried out using pFC54.T. The Hlndlll-BamHI fragment of pPD2 
having the ATG Initiation codon was placed under \ promoter control 
and PRE Influence as follows: 

3 Plasmld pLAPHs was digested with Hindlll and BamHI under 

conditions suggested by the supplier. Plasmld pPD2 was digested with 
the same endonucl eases. The digests were combined and 11 gated with T4 
DNA Hgase with the pPD2:pLAPHB digests at a 10:1 molar ratio. 

After ligation, the ligation mixture was used to transform 

5 competent E^ coM host strain XNDG95. Amplclllin resistant colonies 
were selected and plasmids were isolated therefrom, digested with 
BamH I and Hindi I I and a transformant having the expected Insert of 
approximately 540 bp was designated pJDl. 

EXAMPLE III 
20 Expression of mG-CSF In pPD2 and pJDl 

A. Growth and Induction of pPD2 1n T ransformed E. coli Strains 
MM294 and pJDl in Iransformed b. coli 1n XN uG95 

Initial attempts to produce mG-CSF using the two different 
expression vectors were not successful. The first vector, pJOl, 

25 (Figure 5) consisted of the DMA encoding the mature mG-CSF protein 
under the control of the bacteriophage x P L promoter and the X gene N- 
RBS. The Bacillus thuringlensls positive retroregulatlng element 
(PRE) 1s immediately downstream of the mG-CSF coding region. In 
addition, the origin of replication, derived from ColEl, contained 

30 mutations that conferred a temperature-sensitive Cop" phenotype. t± 
coU cells, strain DG95x (x N 7 N 33 C1837 sus P80), harboring pJDl were 
grown at 30°C to an optical density of 0.3 at 600 nm. The P L promoter 
and plasmid replication were then Induced by shifting the temperature 
to 42°C for three hours. 
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The second vector pPD2, consisted of DMA encoding the mG-CSF 
protein under the control of the E. coll Trp promoter. The Bacillus 
thuringlensis positive retroregulatlng element was Immediately 
downstream of the mG-CSF coding region. E. coll cells, strain 1*1294, 
5 were grown overnight 1n M9 medium plus 100 yg/ml of Trp and 50 yg/ml 
of ampicllHn. In the morning the cells were washed twice with M9 
medium and the Trp promoter was then Induced by setting up cultures at 
an optical density of 0.05 at 600 nm 1n M9 medium without Trp. The 
cells were grown at 37°C to an optical density of 0.8 at 600 nm. 

10 After Induction of pJDl and pPD2, extracts of Induced and 

unlnduced cells were prepared by boiling the cells for five minutes In 
Laemmll loading buffer (Laemmll, Nature, 227:680-685 (1970)) which 
contains sodium dodecyl sulfate (SDS) and 2-mercaptoethanol (2ME ). 
After electrophoresis through 12.5% polyacryl amide gels (PAGE), 

15 containing SDS and 2ME, and staining with Coomassle brilliant blue, an 
Induced protein band of the molecular weight corresponding to mG-CSF 
could not be detected. 

B. Steady State mG-CSF mRMA Level After Induction of Cells 
Harboring pJDl and pPD2 ~~ 

20 To determine the steady level of G-CSF transcripts 1n cells 

harboring pJDl and pPD2, RNA was prepared as follows: 20 ml of 
Induced or unlnduced cells was mixed with 20 ml of 1ce cold phosphate 
buffered saline. The cells were pelleted by centrlfugatlon at 4°C and 
resuspended 1n 3 ml of 0.15 M sucrose-20 mM sodium acetate. 0.3 ml of 

25 a 20 mg/ml lysozyme solution was added and the mixture was Incubated 
on Ice. 0.15 ml of 20% SDS was added and the mixture was extracted 
with equal volumes of phenol :chloroform (1:1) until the Interface was 
clear (about 4 times). After two ethanol precipitations, the nucleic 
acid mixture was resuspended 1n 100 yl of 20 mM Tr1s (pH 7.5J-10 mM 

30 MgCl 2 . Two microliters Of a 1 mg/ml solution of RNase-free DNase I 
was added and the mixture was Incubated at 37°C for 10 minutes. Five 
yl of 20% SDS was added and the samples were then phenol :chloroform 
(1:1) extracted twice and ethanol precipitated 2 times. 
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The Northern blotting analysis was performed as described 
using a modification of a published protocol (Maniatls et al., 
Molecular ClonlngtA Laboratory Manual. Cold Spring Harbor, N.Y. 
(1982)). Briefly, Northern blots were prepared by the electrophoresis 

5 of RNA through 1% agarose gels containing 0.5 M formaldehyde followed 
by blotting onto nitrocellulose filters. After baking the filters for 
1.5 hours at 80*C, they were prehybridized for one hour at 45°C 1n: 5 
x SSC, 0.2% polyvinylpyrrolidone, 0.2% F1coll. 0.2% bovine serum 
albumin, 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 100 y g/ml 

10 tRNA. The blots were hybridized for 16 hours at 45°C 1n a similar 
solution that also contained dextran sulfate (10%) and 10 6 cpm per ml 
of the mG-CSF specific oligonucleotide S'-CGCTGCGCCATCGCCCTGGATCTT-S' 
that had been labeled with y 32 P ATP and polynucleotide kinase. The 
blots were washed 1n 3.2 M tetramethyl ammonium chloride for 15 minutes 

15 at 62°C. The bands corresponding to mG-CSF mRNA were then detected by 
autoradiography. 

Figure 10 shows that Induced cells harboring pJDl contained 
a mG-CSF mRNA of the expected size as well as smaller mG-CSF related 
mRNA while cells harboring pPD2 did not contain detectable mG-CSF 
20 mRNA. 

EXAMPLE IV 

A. Construction of a P t Vector with Alternative mG-CSF Codons 

Two ollgodeoxy ribonucleotides were chemically synthesized 
and purified and had the following sequences: A * 5'- 
25 AGCTTATGACACCATTAGGAC-3 1 , 6 ■ 5'-TAATGGTGTCATA-3' . 

DNA of pJDl was digested with the restriction enzymes 
Hindi I I and Apal and, after electrophoresis, the large fragment was 
Isolated from a Tr1s-acetate, 0.5% low-melting-point agarose gel 
(Crease et al., Methods In Enzymology . 101:78 (1983)). This fragment 
30 was mixed with 20 pmole of each of the above two oligonucleotides. 
This mixture was Ugated with T4 DNA Hgase, and used to transform 
competent E. coll cells of the strain DG95. The cells were plated on 
agar plates containing amplcillin and colonies containing recombinant 
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plasmlds (pJD4A and pJD4B) were Identified by colony hybridization 
(Manlatls et al. supra ) using oligonucleotide A which had been labeled 
with y 32 P ATP and polynucleotide kinase. As shown in Figure 11, pJD4A 
and pJD4B, both encode the same protein as pJDlj however, codons 2, 3, 
5 and 4 of pJD4A and pJD4B are different from the ones used 1n pJDl. In 
addition, codon 5 In pJD4B also differs from pJDl. This difference 
between pJD4A and pJD4B results from the fact that the end of the 
oligomer duplex that was llgated to the Apal site contained a single 
mismatch to the single-stranded end generated by the enzyme Apal * 

10 This type of mismatch will not Interfere with ligations (Hung and 
Wenslnk, Nucleic Adds Research. ,12:1863-1874 (1984)); however, 
depending on the repair of this mismatch, some of the resulting clones 
will regenerate the Apal site and have a pJDl-llke 5th codon and some 
will lose the Agal site and have a pJD4B-11ke 5th codon. The pJD4A 

15 and pJD4B constructions were verified by DNA sequence analysis. 

B. Expression of mG-CSF Protein by pJD4A and pJD4B 

coll harboring either pJD4A or pJD4B were induced and 
analyzed by SDS-PA6E as described for pJDl 1n Example III. Figure 12 
shows that an Inducible protein, with a molecular weight corresponding 
to that of mG-CSF, 1s present 1n E^ co]i cells that harbor either 
pJD4A or pJD4B. 
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C. 



Construction of a P, Vector With Alternative Codons Encoding ™ 
*mmo Methionine Peptidase hrocessable Amino Terminus of HC-BT 



Two oligonucleotides were chemically synthesized and 
25 purified and had the following sequences: 

C: 5 ' AGCTTAT6CCATTA66AC3 ' 
D: 3' ATAC6GTAAT 5' 

DNA of plasmld pJDl was digested with restriction enzymes 
Hlndlll and Apal and the large fragment was Isolated as described 1n 
IV.A. above. 20 pM each of oligonucleotides C and D were mixed with 
the large fragment, llgated with T 4 DNA llgase under sticky end 
conditions, and the llgated mixture was used to transform competent^ 
coli strain DG95. Cells were plated on agar containing amplclllln and 
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Amp resistant colonies containing a recombinant plasmid were 
Identified by hybridization with oligonucleotide C, which had been 
labeled with y 32 P ATP using polynucleotide kinase. The plasmld 
obtained from the probe positive colonies was designated pJD6 and 
5 encoded a G-CSF species In which the th^ 1s deleted, I.e., v^mG- 
CSF. The correct sequence was confirmed by dideoxy sequencing. 

D. Expression of mG-CSF Protein by pPD6 

E. coll DG95 harboring pPD6 was induced and analysed as 

described for pJDl In Example III. 
10 Figure 15 1s a picture of an SDS-PAGE gel of proteins 

produced by E. coli harboring pPD6 and pJD5. The cell clearly shows 

that pPD6 produce an Inducable protein with a slightly smaller 

molecular weight than MG-CSF produced by pJD5. 

The protein band produced by pPD6 was isolated and purified 
15 from a large scale Induction of pPD6. The purified protein was 

sequenced using an automated amino acid sequencer and showed that the 

NH 2 - terminus of this protein 1s pro rather than met or thr. 

EXAMPLE V 

A. Construction of a Trp Expression Vector Containing Alternative 
20 mG-CSF coaons 

DNA from pJD4A was digested with the restriction enzymes 
Hindi 1 1 and BamHI. The small fragment, containing the coding region 
for mG-CSF, was isolated from Tris-acetate 0.5% low-melting-point 
agarose gel. DNA from pPD2 was digested with the restriction enzymes 

25 Hindi 1 1 and BamH I and the large fragment was isolated from a Tris- 
acetate 0.5% low-melting-point agarose gel. These two fragments were 
llgated together with T4 DNA Hgase and used to transform competent E^ 
coli of the strain MM294. An ampiclllin resistant colony containing 
the recombinant was picked and the construct was verified by DNA 

30 sequence analysis. The resulting plasmld was designated pPD5. 
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B* Expression of mG-CSF In pPD5 

E. coll harboring pPDS was Induced and analyzed by SDS-PAGE 
as described for pPD2 1n the previous example. Figure 13 shows that 
an Inducible protein, with a molecular weight corresponding to that of 
5 mG-CSF, was present In E. col 1 cells that harbor pPD5. 

C. Using Alternative mG-CSF Codons Increases the St eady State Level 
of J-CSF mftHA' : *" 

The steady state level of mG-CSF mRNA 1n Induced cells 
harboring either pPD2 or pPD5 was compared by Northern analysis as 
10 described above. Figure 14 shows that induced cells harboring pP05 
contain a mG-CSF transcript of the expected size, while no mG-CSF 
transcripts were detected In Induced cells harboring pPD2. 

The following plasmlds have been deposited In the American 
Type Culture Collection, Rockvllle, MD USA: 



15 Plasmld Strain Date Accession # 

pJD4A E. coll K12 DG95X 8/12/86 67,181 

pJD4B T7 coTT K12 DG95x 8/12/86 67,183 

pPD5A 17 coTT K12 MM294 8/12/86 67,182 

pAWll 17 coTT K12 DG95X 11/8/84 39,918 

20 pTrp3 I7 coTT HC1000 12/18/84 39,946 

pFC54.t I7coTTDG95x 8/7/84 39,789 



These deposits were made under the provisions of the 
Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes, of Patent Procedure and the 

20 Regulations thereunder (Budapest Treaty). This assures maintenance of 
a viable culture for 30 years from date of deposit. The deposits will 
be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between applicants and ATCC which assures 
permanent and unrestricted availability upon Issuance of the pertinent 

25 U.S. patent. The Assignee herein agrees that If the culture on 
deposit die or be lost or destroyed when cultivated under suitable 
conditions, It will be promptly replaced upon notification with a 
viable speclment of the same culture. Availability of the deposits 1s 
not be be construed as a license to practice the Invention 1n 
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contravention of the rights granted under the 
authority of any government in accordance with its 

patent laws. 

The invention includes the use of G-CSF as 

defined above or a mutein as defined above in 

preparing a medicament, e.g. in preparing a 

pharmaceutical or veterinary formulation. 

In another aspect the invention includes a 
pharmaceutical or veterinary formulation comprising 
G-CSF as defined above or a mutein as defined above 
formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

The inventors have disclosed G-CSF and means 
for the efficient expression thereof. Variations 
within the scope of the invention may be made by 
those ordinarily skilled in the art without departing 
from the essence of the invention as claimed herein. 
The illustrative embodiments given hereinbefore are 
not limiting. 
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CLAIMS: 

1. A modified DMA sequence encoding 6-CSF, said G-CSF 
having the effect of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies 1n colony forming assay 
5 using bone marrow cell progenitors of an approprltate species, wherein 
the 5' region thereof comprises deoxyrlboadenlne substituted for the 
deoxyrlboguanlne or deoxyrlbocytoslne 1n the third position of at 
least one codon wherein said substitution does not change the amino 
acid sequence of G-CSF. 

o 

10 2. The DNA sequence according to claim 1 wherein said 5* 

region comprises a DNA sequence encoding the amino acid sequence Thr 
Pro Leu Gly or Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro. 

3. The DNA sequence according to claim 2 wherein at least 3 
codons have said substitution. 

15 4. The DNA sequence according to claims 1, 2 or 3 further 

comprising deoxyribothymidine substituted for deoxyrlbocytoslne 1n the 
first position of at least one codon. 

5. A modified DNA sequence encoding mature G-CSF wherein 
the 5' region thereof comprises codons selected from the group 

20 consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG ACA CCA TTA GGA and 
ATG ACA CCA TTA. 

6. A modified DNA sequence encoding a muteln of G-CSF, said 
muteln of G-CSF having the effect of stimulating the production of 
primarily granulocyte or granulocyte-macrophage colonies 1n colony 

25 forming assays using bone marrow cell progenitors of an appropriate 
species, wherein the muteln does not have an NH 2 -term1nal methionine 
when produced from a recombinant host. 
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7. The modified DNA sequence encoding a muteln of G-CSF of 
claim 6 wherein the NH 2 - terminal end of said muteln encodes a sequence 
that 1s processed by met amino peptidase. 

8. The modified DNA sequence encoding a muteln of G-CSF of 
5 claim 7 wherein the DNA sequence encodes an amino terminal end 

selected from the group consisting of met-ala, met-gly, met-pro, met- 
al a-met, met-gly-met, met-ala-ser, met-ala-pro, met-pro- thr, and met- 
pro-leu. 

9. An expression vector comprising the DNA sequence of 
10 claims 1 to 8 1n operable linkage with a promoter and ribosome 

binding site. 

10. The expression vector of claim 9 in operable linkage 
with a promoter-ribosome binding site selected from the group 
consisting of P L promoter and ribosome binding sites operable 

15 therewith and Trp promoter-ribosome binding site. 

11. An E. coli host transformed with the expression vector 
of claim 9 or claim 10. 

12. A culture of E. coli transformed with the expression 
vector of claim 10 having a G-CSF content of at least about 3% of 

20 total cell protein to about 101 of total cell protein. 

13. Human recombinant G-CSF. 

14. An mRNA fraction isolated from human cells that encodes 
human G-CSF. 

15. The expression vector of claim 10 comprising pJD4 or 

25 pPD5. 
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16. A rauteln of G-CSF, said muteln of 6-CSF having the 
effect of stimulating the production of primarily granulocyte or 
granulocyte-macrophage colonies In colony forming assays using bone 
marrow cell progenitors of an appropriate species wherein said muteln 
does not have an NH 2 - terminal methionine. 

17. The muteln of claim 16 wherein the NH 2 -term1nal end of 
said muteln has an amino acid, sequence that 1s processed by met amino 
peptidase In a recombinant host. 

18. The muteln of G-CSF of claim 17 wherein the amino 
terminal end Is selected from the group consisting of met-ala, met- 
gly, met-pro, met-ala-met, met-gly-met, met-ala-ser, met-ala-pro, met- 
pro-thr and met-pro-leu. 

19. The muteln of G-CSF of claim 18 which yields Vj-mG-CSF 
when expressed In a recombinant host. 

20. A muteln of mG-CSF selected from the group consisting 
of alaj-mG-CSF, gly 1 -mG-CSF, prOj-mG-CSF, pro 1 -thr 2 -mG-CSF, ala 1 -met 2 - 
mG-CSF, gly^-metjrmG-CSF, ala^-se^-mG-CSF, ala^-prOjj-mG-CSF, proj- 
leu 2 -mG-CSF and Vj-mG-CSF. 

21. Vj-mG-CSF. 
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22. A method of producing a modified DNA 
sequence encoding G-CSF, said G-CSF having the effect 
of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies in 
colony forming assay using bone marrow cell 
progenitors of an appropriate species, which method 
comprises substituting deeoxyriboadenine at the 5' 
region of the sequence for the deoxyriboguanine or 
deoxyribocytosine in the third position of at least 
one codon wherein said substitution does not change 
the amino acid sequence of G-CSF. 

23. A method according to claim 22 and 
further defined by the specific feature(s) of any one 
or more of claims 2 to 4. 

24. A method of making a modified DNA 
sequence encoding mature G-CSF which includes 
selecting codons for the 5» region thereof from the 
group consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG 
ACA CCA TTA GGA and ATG ACA CCA TTA. 

25. A method of producing a modified DNA 
sequence encoding a mutein of G-CSF, said mutein of 
G-CSF having the effect of stimulating the production 
of primarily granulocyte or granulocyte -macrophage 
colonies in colony forming assays using bone marrow 
cell progenitors of an appropriate species, wherein 
the mutein does not have an NH^-terminal methionine 
when produced from a recombinant host, comprising 
modifying the NH 2 -terminal sequence encoding 
portion of a DNA sequence encoding G-CSF, for example 
by subjecting cDNA encoding native G-CSF to site 
specific mutagenesis to produce an NH 2 >terminal 
sequence wherein the NH 2 -terminal methionine is 
susceptible to enzymatic cleavage or ligating an 
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oligonucleotide encoding such an Nl^-terminal 
sequence to a specifically digested 6-CSF DNA 
sequence. 

26. A method according to claim 25 and 
further defined by the specific feature of claim 7 or 
of claim 8. 

27. A process for preparing an expression 
vector which comprises forming an operable linkage 
between a promoter and ribosome binding site and a 
DNA sequence as defined in any one of claims 1 to 8, 
the promoter-ribosome binding site preferably being 
selected from the group consisting of P L promoter 
and ribosome binding sites operable therewith and Trp 
promoter-ribosome binding site. 

28. A method of making an E. coli cell 
capable of expressing 6-CSF which comprises 
transforming an E. coli host with an expression 
vector as defined in claim 9, claim 10 or claim 15. 

29. A process for the preparation of 
recombinant G-CSF which comprises culturing an E. 
coli transformed host as defined in claim 11 or a 
culture as defined in claim 12 or E. coli cells made 
by the method of claim 28 so as to permit expression 
of the E. coli genome. 

30. A process according to claim 29 wherein 
the G-CSF prepared thereby is as defined in any one 
of claims 16 to 21. 

31. A pharmaceutical or veterinary 
formulation comprising G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
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formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

32. The use of G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
in preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation as defined 
in claim 31. 
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FIG. 4 

1 AAAAAACAGCCCGGAGCCTGCAGCCCAGCCCCACCCAGACCC ATGGCTGGACCTcr.r.xrr 

MetAiaGiyProAlaThr 
-30 

6 1 CAGAGCCCCATGAAGCTGATGGCCCTGCAGCTGCTGCTGTGGCACAGTGCACTCTGGACA 
GlnSerProMetLysLeuMetAlaLeuGlnLeuLeuteuTrpHisSerAlaLeoTrpThr 
-20 -10 

121 GTGCAGGAAGCCACCCCCCTGGGCCCTGCCAGCTCCCTGCCCCAGAGCTTCCTGCTCAAG 
ValGlnGluAlaThrProLeuGlyProAlaSerSerLeuProGlnSerPheLeuLeuLvs 
-11 10 

1 81 TGCTTAGAGCAAGTGAGGAAGATCCAGGGCGATGGCGCAGCGCTC CAGGAGAAGCTGTGT 
CysLeuGluGlnValArgLysIleG InG ly AspG ly Al aAl aL e uGlnGluLvsLeuCvs 
20 ' 30 * 

241 GCCACCTACAAGCTGTGCCACCCCGAGGAGCTGGTGCTGCTCGGACACTCTCTGGGCATC 
AlaThrTyrLysLeuCysHisProGluGluLeuValLeuLeuGlyHisSerLeuGlylle 

301 CCCTGGGCTCCCCTGAGCAGCTGCCCCAGCCAGGCCCTGCAGCTGGCAGGCTGCTTGAGC 
ProTrpAlaProLeuSerSerCysProSerGlnAlaLeuGlnLeuAlaGlyCysLeuSer 
60 70 

361 CAACTCCATAGCGGCCTTTTCCTCTACCAGGGGCTCCTGCAGGCCCTGGAAGGGATCTCC 

GlnLeuHisSerGlyLeuPheLeuTyrGlnGlyLeuLeuGlnAlaLeuGluGlylleSer 
80 90 

421 CCCGAGTTGGGTCCCACCTTGGACACACTGCAGCTGGACGTCGCCGACTTTGCCACCACC 

ProGluLeuGlyProThrLeuAspThrLeuGlnLeuAspValAlaAspPheAlaThrThr 
100 110 

481 ATCTGGCAGCAGATGGAAGAACTGGGAATGGCCCCTGCCCTGCAGCCCACCCAGGGTGCC 
IleTrpGlnGlnMetGluGluLeuGlyMetAlaProAlaLeuGlnProThrGlnGlyAla 

541 ATGCCGGCCTTCGCCTCTGCTTTCCAGCGCCGGGCAGGAGGGGTCCTAGTTGCCTCCCAT 

MetProAlaPheAlaSerAlaPheGlnArgArgAlaGlyGlyValLeuValAlaSerHis 
140 " 150 

601 CTGCAGAGCTTCCTGGAGGTGTCGTACCGCGTTCTACGCCACCTTGCCCAGCCCTGAGCC 
LeuGlnSerPheLeuGluValSerTyrArgValLeuArgHisLeuAlaGlnPro 
160 170 

661 AAGCCCTCCCCATCCCATGTATTTATCTCTATTTAATATTTATGTCTATTTAAGCCTCAT 
721 ATTTAAAGACAGGGAAGAGCAGAACGGAGCCCCAGGCCTCTGTGTCCTTCCCTGCATTTC 
7 81 TGAGTTTCATTCTCCTGCCTGTAGCAGTGAGAAAAAGCTCCTGTCCTCCCATCCCCTGGA 
841 CTGGGAGGTAGATAGGTAAATACCAAGTATTTATTACTATGACTGCTCCCCAGCCCTGGC 
901 TCTGCAATGGGCACTGGGATGAGCCGCTGTGAGCCCCTGGTCCTGAGGGTCCCCACCTGG 
961 GACCCTTGAGAGTATCAGGTCTCCCACGTGGGAGACAAGAAATCCCTGTTTAATATTTAA 
1021 ACAGCAGTGTTCCCCATCTGGGTCCTTGCACCCCTCACTCTGGCCTCAGCCGACTCCACA 
10 81 GCGGCCCCTGCATCCCCTTGGCTGTGAGGCCCCTGGACAAGCAGAGGTGGCCAGAGC^GG 
1141 GAGGCATGGCCCTGGGGTCCCACGAATTTGCTGGGGAATCTCGTTTTTCTTCTTAAGACT 
1201 TTTGGGACATGGTTTGACTCCCGAACATCACCGACGTGTCTCCTGTTTTTCTGGGTGGCC 
1261 TCGGGACACCTGCCCTGCCCCCACGAGGGTCAGGACTGTGACTCTTTTTAGGGCCAGGCA 
1321 GGTGCCTGGACATTTGCCTTGCTGGACGGGGACTGGGGATGTGGGAGGGAGCAGACAGGA 
13 81 GGAATCATGTCAGGCCTGTGTGTGAAAGGAAGCTCCACTGTCACCCTCCACCTCTTCACC 
1441 CCCCACTCACCAGTGTCCCCTCCACTGTCACATTGTAACTGAACTTCAGGATAATAA AGT 
1501 GTTTGCCTCCAAAAAAAAAAAAAAA 
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1 2 3 4 5 

MET THR PRO LEU GLY 

G-CSF CODONS ATG ACC CCC CTG GGC 

PJD4B, pPD5 ATG ACA CCA TTA GGA 

pJD4A ATG ACA CCA TTA GGC 
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